Multi-Stream Deep Similarity Learning Networks for Visual Tracking

نویسندگان

Kunpeng Li

Yu Kong

Yun Fu

چکیده

Visual tracking has achieved remarkable success in recent decades, but it remains a challenging problem due to appearance variations over time and complex cluttered background. In this paper, we adopt a tracking-by-verification scheme to overcome these challenges by determining the patch in the subsequent frame that is most similar to the target template and distinctive to the background context. A multi-stream deep similarity learning network is proposed to learn the similarity comparison model. The loss function of our network encourages the distance between a positive patch in the search region and the target template to be smaller than that between positive patch and the background patches. Within the learned feature space, even if the distance between positive patches becomes large caused by the appearance change or interference of background clutter, our method can use the relative distance to distinguish the target robustly. Besides, the learned model is directly used for tracking with no need of model updating, parameter fine-tuning and can run at 45 fps on a single GPU. Our tracker achieves state-of-the-art performance on the visual tracking benchmark compared with other recent real-time-speed trackers, and shows better capability in handling background clutter, occlusion and appearance change.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deep Tracking: Visual Tracking Using Deep Convolutional Networks

In this paper, we study discriminatively trained deep convolutional networks for the task of visual tracking. Our tracker utilizes both motion and appearance features extracted from a pre-trained dual stream deep convolution network. By using optical flow and deep networks to implement a dual appearance and motion stream to inform tracking, our tracker outperforms current state of the art track...

متن کامل

Tracking of Humans in Video Stream Using LSTM Recurrent Neural Network

In this master thesis, the problem of tracking humans in video streams by using Deep Learning is examined. We use spatially supervised recurrent convolutional neural networks for visual human tracking. In this method, the recurrent convolutional network uses both the history of locations and the visual features from the deep neural networks. This method is used for tracking, based on the detect...

متن کامل

A multi-scale convolutional neural network for automatic cloud and cloud shadow detection from Gaofen-1 images

The reconstruction of the information contaminated by cloud and cloud shadow is an important step in pre-processing of high-resolution satellite images. The cloud and cloud shadow automatic segmentation could be the first step in the process of reconstructing the information contaminated by cloud and cloud shadow. This stage is a remarkable challenge due to the relatively inefficient performanc...

متن کامل

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

This paper discusses the problem of tracking from a deep learning approach. This experiment takes cues from how the brain is modeled to create deep convolutional networks that mimic how the human brain tracks objects. By using optical flow and deep networks to implement a dual appearance and motion stream, our tracker outperforms current state of the art methods.

متن کامل

Learning Dual Multi-Scale Manifold Ranking for Semantic Segmentation of High-Resolution Images

Semantic image segmentation has recently witnessed considerable progress by training deep convolutional neural networks (CNNs). The core issue of this technique is the limited capacity of CNNs to depict visual objects. Existing approaches tend to utilize approximate inference in a discrete domain or additional aides and do not have a global optimum guarantee. We propose the use of the multi-lab...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2017

Multi-Stream Deep Similarity Learning Networks for Visual Tracking

نویسندگان

چکیده

منابع مشابه

Deep Tracking: Visual Tracking Using Deep Convolutional Networks

Tracking of Humans in Video Stream Using LSTM Recurrent Neural Network

A multi-scale convolutional neural network for automatic cloud and cloud shadow detection from Gaofen-1 images

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

Learning Dual Multi-Scale Manifold Ranking for Semantic Segmentation of High-Resolution Images

عنوان ژورنال:

اشتراک گذاری